OCA: Opinion corpus for Arabic

نویسندگان

  • Mohammed Rushdi-Saleh
  • Maria Teresa Martín-Valdivia
  • Luis Alfonso Ureña López
  • José Manuel Perea Ortega
چکیده

Sentiment analysis is a challenging new task related to text mining and natural language processing. Although there are, at present, several studies related to this theme, most of these focus mainly on English texts. The resources available for opinion mining (OM) in other languages are still limited. In this article, we present a new Arabic corpus for the OM task that has been made available to the scientific community for research purposes. The corpus contains 500 movie reviews collected from different web pages and blogs in Arabic, 250 of them considered as positive reviews, and the other 250 as negative opinions. Furthermore, different experiments have been carried out on this corpus, using machine learning algorithms such as support vector machines and Naïve Bayes. The results obtained are very promising and we are encouraged to continue this line of research.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bilingual Experiments with an Arabic-English Corpus for Opinion Mining

Recently, Opinion Mining (OM) is receiving more attention due to the abundance of forums, blogs, ecommerce web sites, news reports and additional web sources where people tend to express their opinions. There are a number of works about Sentiment Analysis (SA) studying the task of identifying the polarity, whether the opinion expressed in a text is positive or negative about a given topic. Howe...

متن کامل

Investigation of the Feature Selection Problem for Sentiment Analysis in Arabic Language

Sentiment analysis, which is also known as opinion mining, can be defined as the process of the automatic detection of the attitude of an author towards a certain subject in textual contents. In this study we design and implement a document-level supervised sentiment analysis system for Arabic context and investigate its performance. We use three different feature extraction methods in order to...

متن کامل

CRF-based Arabic Opinion Summarization System

This paper presents the study that we have carried out to investigate supervised opinion summarization in Modern Standard Arabic. We use a corpus of news articles. We use conditional random fields (CRF) as machine learning technique. We investigate some features to identify those that allow achieving the best results. Our contribution is to use opinion specific features to summarize Arabic news...

متن کامل

حس‌نگار : شبکه واژگان حسی فارسی

Awareness of others' opinions plays a crucial role in the decision making process performed by simple customers to top-level executives of manufacturing companies and various organizations. Today, with the advent of Web 2.0 and the expansion of social networks, a vast number of texts related to people's opinions have been created. However, exploring the enormous amount of documents, various opi...

متن کامل

Opinion Changing Aversion Functions for Group Settlement Modeling

Opinion changing aversion (OCA) functions are used to quantify the decision makers’ resistance to opinion changing. By introducing OCA functions of polynomial form we will show that if each expert has a quadratic opinion changing aversion function then the minimum-cost solution is nothing else but the weighted average of the individual optimal solutions where the weights are the relative import...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JASIST

دوره 62  شماره 

صفحات  -

تاریخ انتشار 2011